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On the Conditions under which the "Probable Errors" of 
Frequency Distributions have a Real Significance. 

By L. Isserlis, B.A. 
(Communicated by Prof. Karl Pearson, F.B.S. Received June 21, 1915.) 

"When we seek the value of a statistical constant, we may either consider 
the whole aggregate of individuals possessing characteristics of which the 
constant in question is a function, or we may limit ourselves, from choice or 
necessity, to the consideration of a random sample of the whole population. 
The mean height of Englishmen of military age, at a given instant, is a 
constant which could be determined by considering the whole population, but, 
in practice, would be determined from a random sample. On the other hand, 
the mean weight of adult herrings frequenting the North Sea is necessarily 
to be determined only by a consideration of a sample of the whole population. 

Statistical constants calculated from a sample give us little information 
unless we know, at the same time, the manner in which the values may be 
expected to vary from random sample to random sample, i.e. the frequency 
distribution of the constant in many samples. The universal custom is to 
state the " probable error "* of the constant, which is equivalent to giving 
the parameter of the normal curve of errors representing the frequencies of 
the deviations of the values of the constant, calculated from the various 
samples, from the value of the constant in the population as a whole. The 
parameter — the standard deviation of the frequency distribution — therefore 
ceases to provide an adequate description of the facts if the frequency 
distribution differs sensibly from the normal. 

Now any statistical constant may be determined by means of the moment 
coefficients of its frequency distribution, and it is well known that the mean 
and the standard deviation, i.e. the first and second moment coefficients of a 
frequency distribution, themselves have frequency distributions which rapidly 
approach normality with the increase in the size of the sample, at any rate, 
when the sample is itself of small size compared with the size of the whole 
population. 

When a frequency distribution ceases to be normal, it can, in the 

* The statement that a statistical constant has a value C with a " probable error " E 
means that if the constant be determined many times from random samples of the 
relevant material, the value obtained will differ from C by less than E, in half the 
number of cases. E is always taken equal to 0"67449 times the standard deviation • 
this is based on the assumption of normal distribution, and it is this value of E that 
ceases to have real significance if the distribution be not normal. 
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overwhelming majority of cases, be represented by one of Pearson's well known 
frequency curves, the constants of such a curve requiring in addition to the 
mean and standard deviation, a knowledge of the /3-coefficients /3i = /is 2 / ^ 
and ft 2 = {*4:/v>2 2 , where fi3, /n^, are the third and fourth moment coefficients. 
For a Gaussian distribution /3\ is zero and /3 2 is three. I propose, in the 
present paper, to discuss the values of these /3- coefficients for a moment 
coefficient of order u in the case of a sample of size n extracted out of a finite 
population of size N". It will appear that in the case of moment coefficients 
of moderate order, /3i and /3 2 approximate to their normal values as the size 
of the sample increases, especially if the sample is itself small compared to 
the general population. Tor coefficients of high order,* however, the sample 
has to be an inconveniently large fraction of the population itself if /3i and 
/3 2 are to approach even approximately to their Gaussian values. The usual 
" probable error " ceases for such moment coefficients to have Gaussian 
significance and statistical constants, founded on such moments, cease to be 
of value, unless they are accompanied by the constants of their skew 
frequency distribution. 

It is to be hoped that the values of /3 h /3 2 , which are here given without 
approximation, for moments of any order, may prove useful in applications 
in which n/~N is not small and moments of high order necessarily occur. The 
results of the present paper suggest, however, that there is little value in 
graduation formulas which, thanks to the employment of high moments, 
represent a given sample with photographic accuracy. 

Now the moment coefficient of order u obtained from a sample of n drawn 
from a population of E* is given by the formula 

fjf u = S(n s x s ll )/n, 

where n s is the frequency of the group of character x s in the sample, x s being 
measured from some fixed origin. The mean value of (jl u or pf u will clearly 
be the value of // w for the population N", or 

£'„ = S(N A *)/ff, 

where N" s is the frequency of the group of character x s in the whole popula- 
tion 1ST. 

djjf u = fi'u—fiu = ${(n s — ~N s n/N)x s u }/n = S(dn 8 . x s u )/n, 

where dn s = the deviation of n s in the sample. 

We have now to find the moments M/, M 3 ', M 4 ', of the distribution of 
dn' u to ascertain the values of ft' = M 3 /2 /M 2 /3 and fy' = M 4 '/M 2 ' 2 and 
ascertain how these approach the values zero and three, which would accord 
with normality as the value of n is increased. It is clear that we shall 

* For moments of odd order (3 1 tends to zero, however high the order may be. 
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require for this purpose a knowledge of the mean values of (dm s ) 2 , (dn s ) s , 
(dn s y, as well as the mean values of various products of the deviations of type 
dn s dn S j and in general of products of type (dn s )\dn s ) m ..., of degree not 
higher than the fourth. 

In the next place when the moment coefficients /x u are taken about the 
mean of the sample, so that 

fiu = 1/n . S{n s (w s — xf) 
and 

djx u = 1/n . S{dn s (x s — x) u }~- udx/A u _i, 

we shall require for the determination of the moments M 2j M 3 , M 4 of the 
deviation d/u u , a knowledge of the mean values of products (of degree not 
higher than the fourth) of type 

(dn 8i )\dn s J f m ...(dx) k . 

We proceed in the next two sections to a systematic investigation of the 
mean values of products of these tw T o types. 

2 (i). The Moment Coefficients of the Group Frequencies. — If a sample of n 
be extracted from a population of N containing Np with and "Nq without a 
given characteristic A, then the moment coefficients are the moments of 
the terms of the hypergeometrical series 

p X(j?N-l)...(pN-n+l) y( R n m 

N(N-l)...(N-* + l) *W>?> 1 >> W 

where a — — n, /3 = — qN, 7 = p~N— n + 1. (2) 

The moments of this series are known to be as follows :* 

N— n 
M2 = npq ^— p 

, x Is r — n N— 2n 
fis = npq(p-q) 



y"4 






-f N[3pgn(6 — n) + 1 — 6ft] + 6rt 2 (l — 3^)}. 

In the case of a character occurring in a group 1ST, of N", p — N s /lSr and 
g = 1 — Ns/N", whence by substitution in the above values we obtain the 
moment coefficients of ?i s occurring in the sample ft. 

Let us write 

N"— ft _ ]sr--2ft _ , N— 3ft __ „ {9 

* 'Phil. Mag., J 1899, pp. 236-246, Prof. Karl Pearson, "On Certain Properties of the 
Hypergeometrical Series." 
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and ^ = 2-4 x ' + 3 x " + %7»i-6 x '7» ) (4) 

*7» = -i+3 X '-V- 3 x7»+4x7«- (5) 

When the sample is small compared with the total population, so that 
nfN is negligible, we have 

X = % = X = x > 

^ = 1 — 2 j n and $' = 1. 

Then the mean value of <i% 2 is %^ . Ni/N" . (1 — Ni/N"). 

Or Mean efoii 2 = %ni(l—ni/n), (6) 

to the best of our knowledge, as %/?i is the best value we can put for Ni/N. 
Similarly- 
Mean value of dn x s = xx n i0-~~ n if n )(l"~~% n i/ n )> CO 
Mean value of (Z% 4 = %%(1— ^i/^){3^i(l— %/ft) + <£'}. (8) 

(ii) 7Ae Product Moment Coefficients of the Group Frequencies, — Let a popula- 
tion N" contain Ni individuals of type 1, IS 2 individuals of type 2 and 
N—Ni — N 2 others. 

A sample of size n being taken, the probability that it contains s\ individuals 
of type 1, s 2 individuals. of type 2, and n~s l -rS2 others, is clearly: 

n\ (NA(N2).(N"-Ni-N 2 )n-.-. 

(N), l_*ii*! ''(N-Ni-N 8 -» + l) fi+ 

= (N-Ni-Na). f [«]« > [« / ]»,[^1«, + 
(N)» I s 1 !s 2 ![ 7 ] s+s J* 



iV 



where 



1 2 



(A) A = 7t(A-i)(A-2)...(A-*+l), 
[A]* = A(A + l)(A + 2)...(A + fc-l) > 
while cc = — Ni, ^ = — N" 2j /3 = — ra, 7 = N — N"i— N" 2 — w + 1. 

Thus the product moment coefficients are those of the terms of the double 
hypergeometrical series 

H«, «', A 7 , l, l) = S 2 ^ Ca M^ 



^ * 2 Si ! s 2 ! [7]^+*, 



The product moments of this series have been calculated by the author* 

* ' Phil. Mag.,' 1914, pp. 385-387, L. Isserlis, " Application of Solid Hypergeometrical 
Series to Frequency Distributions in Space." 
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and the values required in the present paper can be obtained by replacing 
the quantities, 

r } r', n, q and p of the ' Phil. Mag.' article by 

Ni, N~ 2 , N", njTS and 1,— ?i/N respectively. 
We thus obtain the following additional mean values — 

Mean value of dnidri2 = — x nin2 / n > 00 

Mean value of dni 2 dn 2 — -~xx nin2 / n • (1 — 2?ii/w). (10) 

It was unnecessary for the purpose of fitting the double hypergeometrical 
series to calculate the product moments ^31, Pis and ^22, but as the mean values 
of dni s dn 2 , din\dn£ and dn^dni are required in the present investigation, w r e 
will indicate briefly the methods by which these product moments are 
obtained. The notation is that employed in the ' Phil. Mag.' article previously 
quoted. Operate on Equations (11) and (12)* with 0<j> 2 and <£ 3 respectively 
and we obtain 

%22 + Xi3 + Xia( m i — 3) — Xw( r + !) + %02(^2 — '% + r + 2) 

= %22(^~ ^') + %13 ?, + %12( — ?' — W 2 + f'~4 (A-) 

and %i3(^ — r') + ?*%04 + %oa( — • ?" — ^2 + f' — ?0 = 0. (B) 

Divide by xoo and put x = y = 1, then equation (13) becomes 

Z/13(7& — ^ / ) + ^04+I'03(— • ?*— ^2 + ^- %) = 0, 

or replacing the raw moments by their equivalents in terms of moments 
about the mean 

fxis + 3d'/ii2 + ^03 + 3d' 2 /j,n + odd'fjL 2 + ^ /3 ) (n — • r ') 

+ 0*04 + 4^ V03 + 6d' 2 fiQ 2 + d f y + (^03 + 3d> 2 + ^' 3 )( - r - ?w 2 + r' - w) = 0. 
If we now use the moments already found, we obtain on reduction 

/*i3 = —rl(n—r f ) . /* 04 . (O) 

If we treat (A) in the same way, it becomes 
fL2 2 (n— r' — l) = /*i 3 (l— r)4-/Ao3[^(l— r) — (r + 1)] 

+ A t i2[3d'(l— r)-Hi — ^?i --3+ rqn—r f — 2d(n — r f — 1)] 

+ M- 2^-^-1)] 

+ i a 02 [3^ / (l-r)-3^(l-fr) + (l-fr)(^+2) 

+ tf (w — £w — 3 + rqn — r') — d\n»— r ' — 1) ] + y t6 2 o[ — d f \n — r f — 1) ] 

+ f M 11 [3d\l-r) + 2dXn~qn-3 + rqn-r / )-MdXn-r , ~l)] 

-r (1 —r)ddfz + (n — qn—3 + rqn — r')dd /2 — (r + l)d' s 

+ (r + l)(qn+2)d /2 -(n~r' -l)d 2 d /2 . 

* < Phil. Mag.,' 1914, p. 380. 
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A similar equation will hold in which the suffixes and accents are inter- 
changed, i.e. fjihk is replaced by fi^, r by r\ and d by d r . Let this second 
equation be subtracted from the one last written; the factor r — r will 
cancel, and, after considerable reduction, we find 

*" = (n-l)(^2)(n-3) [n,i>g " n,{1+fg(r + r ' )} 

-+■ n{ 1 + (2 — 6pg) (r + r') -f Sjoqrr } — 6rr'(l — 3p#)]. (D) 

Prom (C) and (D), we find 

Mean value of dn 1 d dn 2 — — % -^[3</mi(l — nijn)-\-^> f '\. (11) 

Mean value of dni 2 dn 2 2 = %%?i2 [<£ (1 — n\jn — n 2 jn + 3nin 2 /n 2 ) + $' jri\. (12) 

(iii) We shall require also the mean values of dn\dn 2 dn^ dni 2 dn 2 dn^, and 
dn\dn 2 dnzdn±. 

Now, for given values of dn\ and rf^ 2 , the mean value of dn% will be given 
by the regression plane 

Mean dn» = ri3 ~ n2 ? s f"»d Ml + ^zTj^ig ^^ 

Now from (6) and (9) a n 2 — ^ni(l — ni/n) 

and r V 2 2 = nin 2 /(n — n{)(n—n 2 ). 

Hence mean dns for constant dtoii, dn 2 = - ; 3 ' — 9 — - — i — > 

l—ni/n — n 2 Jn 

and mean dn\dn 2 dn? y = r-^ — (mean ^% 2 ^2 + mean d%\dn 2 2 ) 

l—ni/n — n 2 /n 

2nin2n 'i X X- ( 13 ) 
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The mean value of dn^dn 2 dn% is similarly seen to be 

~~~J l *l n — (mean dn^dn 2 + mean dn{ 2 dn 2 2 ), 

l — ni/n-~n 2 /n 

and by the use of (11) and (12) we find, after some reduction, 

Mean value of dnfdnrfn* = -y<£ ^^ (1 - — *). (14) 

n * n 

Finally, to find the mean value of dn\dn 2 dn%dn^ we use the regression 
equation 

Mean d%\ for constant dn 2 , dn% t dn±, = — ~^ — dn 2 — — — dn%~ ~Ji~ dn±* 

* ' Phil. Trans.,' A, vol. 187, p. 302. 
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where B M denotes the minor (with its proper sign) of r m in the determinant 

B = 1, ris, ri 3 , r 14 , 

^21, 1, ^23, ^24, 

^31* ?"32, 1, ^34, 

r 4 i, r 42 , r 43 , 1. 
Using the values of the o-'s and r's as above, we find mean value of 

= — (mean dn^dn^dn^ -\-meeni dn 2 dnz 2 dn± -f mean dn 2 dn%cln^) f 

and with the aid of (14) this reduces to 

mean value of dnidn^ln^dn^ = %yj$>n\ntfizn±\ vt? . (15) 

When the total population N" is infinite, the frequency distributions can 
be represented by binomial series in place of the hypergeometrical series 
employed above. If, in addition, we suppose the sample n to be so large 
that l/n s may be neglected, we get the case considered by Mr. H. E. Soper* 
in his discussion of the probable error of the correlation coefficient, and we 
may note, as a check on the results (6) to (15), that when we put 
% = %' = % r/ = 1, </> = l — 2/n, $' = 1, and neglect terms in 1/w 3 , they 
agree with the values in equations (xviii) to (xxvii) of his paper. For the 
special case of a normal distribution, the mean value of dni 2 dn 2 2 can be 
deduced from a result published by Prof. Karl Pearsonf 

^22 — 3^20^02 = (^ — 1)2^02^20, 

leading to ^22 = 2tii?i 2 % 2 (1 — ni/n — n 2 jn 4- 3nin 2 /n 2 ), 

agreeing with our equation (12). 

We may note also that the methods of the latter part of this section may 
also be employed to obtain the mean values of d?iidn 2 and dn^dn 2 directly 
from the single hypergeometrical series, but fail to give mean value of 
dni 2 dn 2 2 . This necessitates the employment of the double series. 

3. We are now in a position to calculate the exact values of the moments 
M 2 ', M 3 ', M/ and the j3 coefficients B/, B 2 ' of the distribution of d/u' Ui the 
characteristics x\, x 2 , ..., being measured from some fixed origin 

ja'u = S(n s x s u )/n, 

.*. d/j,' u = S(dn s x s u /n), 

■ ' * ' Biometrika,' vol. 9, p. 95 (1913), H. E. Soper, M.A., " On the Probable Error of 
the Correlation Coefficient to a Second Approximation. 

t ' Drapers' Company Besearch Memoirs,' XI Y, " On the General Theory of Skew 
Correlation," p. 20, footnote. 
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and 



or 



M 2 ' = mean value for all samples of {S(chi s x s u )} 2 /n 2 , 
M 2 ' = mean value of-- {S(dn s 2 x s 2u )-j-2S(dn s d?i t x s u x t u )} } 



1 



=72 S tX^C 1 ~~ n s/n)x s 2u — 2xn s n t x s u x t u /?i], 



using (6) and (9), 



or 



n 



(16) 



For the third moment coefficient about a fixed origin, we have 
Ms = mean value of (dfi'yf, 

1 
= mean value of — [$(dn s % Su ) + 3S(dn s 2 dn t x s 2u x t u ) + 6S(rZ^ s ^??.^??.^/^^ w )], 

using (7), (10), and (13), this becomes 



n 6 



S{n s (l-n s /n)(l - 2n s jn)x s ^} 



3S<!^1 2n 



n 



n 



t)x 8 **xt»\ + 12S rl^x s u xfx h 



u 



reducing to 



Thus 



B/ 



Ma 7 = ^ {^ , 3 u-3^V / B + 2^ B 3}. 

= Ml 2 = i£(^-VV«+V» s ) 2 

M 2 ' 3 » % 



(17) 



(18) 



If the population out of which we draw the sample is large compared to 
the sample, we may put ^ = ^' = 1, and (18) may be put in the form 



n[${n s (x s — / u ) 2 }Y' 
The fourth moment about a fixed origin is 

M 4 ' = mean value of (dfj,' u Y = mean value of -> — -v^ 



(19) 



W 



4 



mean value of 



1^ 

n 4 



S(dniW u ) + 4$>{dnihln 2 x^ u x 2 u ) 4- ^(dnxhlnix^x^) 

4- 1 2S(dni 2 dn 2 dn3Xi 2u X2 u x d u ) -f 24S(^i^i 2 ^^3^k^i M ^2^3 M ^4 w ).- 



We insert in this expression the mean values of the fourth order moments 
and product moments of the frequency deviations given by (8), (11), (12), 
(14), and (15), and find 



of Frequency Distributions have a Real Significance. 31 



M/ = M S 



(wi 4 #i 4m 4- 6?ii%2 2 ^i 2l ^2 2w 4 12?li 2 n2^3^1 2M ^2 M «^3 M 



,4 



+ 4fti 3 W 2 #l 3M #2 M + 24?li^2%W 4 a? 1 M ^ 2 M e«3 M ^4 M ) 

2 

(??i 3 #i 4m 4- 2n 1 2 n 2 x 1 * u x 2 u 4- n{ 2 n 2 Xi 2u x 2 2u + 2nin 2 n B z 1 2u x 2 u X3 u ) 



n s 



+ -p(^i 2 A 4M + 2tii??2^i 2m ^2 2m ) 



?i 2 



or 



4- ^§- S [ — n^x^ u jn 2 4- nix^ u jn — ^nin 2 x^ u x 2 u j n 2 4- §n\n$c-? u x£ u j n 2 \ 
M/ = ^[(^ , ,) 4 ™2( / x / 2,)(M / w ) 2 + (^) 2 ] 



w 2 



+ 2^ [3(m'3u) 2 -4/a V'« + AJ- (20) 



w 3 



This can be rewritten in the form 
3- 



M*' = ^(<f> + 2<l>'fn) 



n 2 



4- 3^2- [// 4m — 4// 3m/ u/ m 4- 6/t' 2w ^V 2 — 3^V— 3(/i'2w— ^V) 2 ]. (21) 

But we have already shown that 

Ma^X^-^V), (16.) 

therefore the " second /3 " of the distribution of the moment of order u about 
a fixed origin is given by the elegant formula 

p ,_ 3(<ft4-2<ft7*0 , ft 7 / S{^(^/— m 7 m ) 4 } q\ / 99 v 

% ^%\[S{%(^~^) 2 }] 2 V ( ^ 

Now for an infinite population $ = 1 — 2/n and 0' = 1= ^. For such a 
population, therefore, 



B f Q 
2 — O = - 



s{^/-/Q- 



L[S{M^-A) 2 }] 2 M i (23) 

Let us put u = 1 in equations (19) and (23) and we obtain the well known 
result : — 

If Bi, B 2 &e 27&0 constants of the distribution of the means of samples of n 
drawn at random-, corresponding to j3\, ft 2 in the original frequency distribution 
(with infinite population), then 

B x = fii/n, Ba-3 = (/8 2 -3)/ti.* 

In other words the Bi and B 2 tend to the normal values and 3 respec- 
tively as the size of the sample increases. 

* Henderson, K,, ' Journ. Inst, of Actuaries,' vol. 41, pp. 429-442 ; of. also " Student," 
i Biometrika/ vol. 7, p. 210. 
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4. With a view to their use in investigating the values of the moments M 2 , 
M3, M u , and the ^-coefficients Bi and B 2 of the moment coefficients fi U} no 
longer about a fixed origin, but about the mean of the sample, we proceed in 
this section to the calculation of the products of various powers of the 
deviations in the frequencies of one or more of the characters and various 
powers of the deviation in the mean. 

nx = %\X\ + n 2 x 2 + %a% + . .'. 
. • . ndx = X\drii + x 2 dn 2 + . . . 
n 2 dx 2 = $(xi 2 dni 2 ) 4- 2S(xiX 2 dnidn 2 ), 
.*. mean dx 2 = ^S[xi 2 n(l—7ii/?i)/7i 2 ] — 2'%$(n 1 ?i 2 x 1 x 2 fn s ) i 
by (6) and (9), 
or mean dx 2 — %(/a'2— m' 3 )/^ = X^f n - (24) 

When N" = 00 this reduces to the well known formula for the probable 
error of a mean : 

<?x = o'xjx/n. 

Of course the result in (24) can be obtained directly by putting u = 1 in 
equation (16). In the same way we obtain directly from (17) and (21) the 
results : — 

Mean value of dx s = 2£^ « 3 . (25) 

Mean value of d& = % (<£ + 2^/w) + ^ (/**- 3/* 2 2 ). (26) 

it lb 

Mean value of dn^dx 

= mean value of \x\dn-fjn 4- d?ii(x 2 dn 2 /n -f x^cln^jn + . . . )] 

by (6) and (9) 



% 



#1 / 1 _ % \ ^2 ^1^2 ^3 %% 



.w \ n 1 n n n n 

We shall find it convenient, and an aid to brevity to denote #1— #, 
<r 2 — #, etc., by Xi, X 2 , ..., hence 

mean value of dn^dx = ^— Xi. (27) 

Mean value of dn^dx 

= mean value of [xidn 1 3 /n + dn 1 2 (^x 2 dn 2 /7i + xsd7is/n-{- ...)] 

= %%'[^i^i(l — ni/n)(l — 2ni/n)—x 2 n 1 n 2 (l — 2ni/n)/n-~ ...] by (7) and (10) 

= £%' Wl - 2n 1 /n)X 1 . (28) 
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Mean value of chiidx 2 = — = mean value of (&'idni + x 2 dn 2 + . . .Yd^ 

I 

— — mean of x^dn^ 4- 2xidn 1 2 (x 2 dn 2 4- #3^3 4- . . . ) 

+ dni(2x 2 X'sdoi 2 dns +...)• 
Using (7) and (10) and (13), this reduces to 

%%' -^ [ — 2 ( ! — 2wi/w) ^(x—n&i/n) + a?i 2 (l — %i/w)(l - 2%,%) 



w 



(// 2 — ni%i 2 -fn) 4- 2(« 2 — 2n 1 Xi 2 jn 2 — 211^2X1X2/ n 2 — ...)> 



= %%' ^t-^'a + ^ + C^-*) 8 ] 

^^gCXi 2 "^). (29) 

The mean value of dnxdn 2 dx 

= mean of [#idfoi 2 ^ 2 4- x 2 dn 2 2 dni 4- dn\dn^xzdn^+ x 4 dn^ + ...)]> 
and using (10) and (13) this reduces to 

%% / ^^[~^( 1 ^ 2 ^i/^)~-^(l--2^2A0 + 2 /^ • (Wfe+w^ +...)] 

= -^^(Xi + Xa). (30) 

For the mean value of dfoi 3 d& we use (8) and (11) and find mean of 
L^L (^dtoi 4. ^«2 4- . . . ) 

or Mean d^dS = £ [3<£%(1 — %/w) 4- <£'>iXi. (31) 

The mean value of d?ii 2 dn 2 dx 

= mean of — ±— J [fadm 4- ^w 2 ) + #3^ +...], 
using equations (11), (12), and (14), we get 

- ~ (x w W^)[3^ 1 (l-%/%)4-0 / ] 

71' 



+ ^x^i^C^i-wi/w-V^+s^^M+^M] - ^%^^ 4>(i-3m/n) 



x± n 1 n 2 n 4c JL/1 , x 

VOL. XOII. — A. 



D 
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j? 



and obtain after a further reduction 



ni%2 



Mean dm 2 dn 2 dx = % — [{^(l-^W + fWX 2 -(2%/^fWXi]. (32) 



The mean value of dmdn 2 dn 3 dx 



= mean of - \dmdn 2 dn^{x\dm + x&in2 + xsdns) + dmdn 2 dn^{x^dm + ...)] 



x4> 



n x n 2 n z 

n 



X\ 



x 2 



%z 



±i(l_3%/ra) + - 2 (l-3Wu)+ ^(1-W») 
_n n % 



o 



0X4. 



n 



^ -mfn+... by (14) and (15), 



reducing; to 



Mean dmdn 2 dn^dx = — %</> 



nin 2 ns 

n 2 



(X! + X 2 + X 3 ). 
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The mean value of dm 2 dx 2 



mean of 



<$% 



c?w 



^! 2 — §- -f 2xi — i- {xtfLn* + ...) + x 2 2 dm 2 dn 2 2 + xidn^dni + . . . 



%• 



w 



+ 2x 2 xzdm 2 dn 2 dn% 4- . . 



With the aid of equations (8), (11), (12), and (14) this can be reduced to 



X$ 



w 



3% 2 (1 —ni/n) 2 Xx 2 4 nin 2 (l -~~nifn-~~n 2 fn 4 Zmn 2 \n 2 )x 2 2 4 . . . 



L (1 —nijn)xi{n 2 x 2 4 %^ 3 4 , . . ) 

V± (1 — 3m In) (n 2 n 3 x 2 %s 4 w 2 w 4 a?2#4 4 . . . ) 

4 %jL [m(l —ni/n)xi 2 4 mn 2 x 2 2 jn + . . . — 2mn 2 XiX 2 jn — 2%\n^x\Xz\n —-...], 
so that 



^1 



Mean value of dm 2 dx 2 = % — [^2 {$(1 -"*"*• %/?0 + </>7^} +X 1 2 (2<j&%/^ + 07^)]« 



w 



(34) 



The mean value. of dmdn 2 dx 2 



= mean of 



dmdn<< 



7W 



[(xidm 4 x 2 dn 2 ) 2 4 2{x x dm 4 #2^2) (^3^3 +...)+ ^ 2 ^3 2 + . . • 

4- 2^x^dnsdm + ...]. 
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To reduce this we use (11), (12), (14), and (15), obtaining 



Xy 



2 I 



TV 



2 



n 1 n 2 ,< 



% ~^— - (Scj>ni(l — %\jn) + <f>') 



n 






n 2 



n x n 2 



IV 



uX\X 2 



$(1— - nij% — %%\% J r s &n 1 %2l f n?) 4- — 



**l~ 



2«i 



n* 



2x 



[ ~ ^3%<£%^2%/^ ~" obxfaiWH/n — . . . ] (1 — 3^/w) 



+ — i [^^3X#i%%/^~^4%^>%%%/^--...](1---3^2/^) 



w 



tffc 



from which we deduce 



^0 — — ^ i _ 0^3/ ft) — - . . . H ~~ x<P ni^ntfHj n +..., 



%\% 2 



Mean value of dn x d% 2 dx 2 = ^ — ?[0(2XiX 2 — yct 2 )-~ (Xx— X 2 ) 2 ^>7^]- (^^) 



n 



Finally we shall require the mean value of dn\dx 2 , i.e. of 



w 



'- [xiHtv^ -f o%i 2 (x 2 dni s dn 2 + ...)•+- %x\{x£d%\ 2 dn£ + . . .) 



+ 6%(^2a?3(cZ% 1 2 ^2^^3 + ...) + x 2 Hnxdn 2 z + . . , -f- Sx 2 2 xsdMidn 2 2 dn^ -h ... 

This can be reduced by means of equations (8), (11), (12), (14), and (15). We 
give the final result 



n t 



t % 



Mean value of dnxdx 3 = 3%<£ ~ ^ 2 Xi -f %$' -~ [Xi 3 + 3/^ 2 Xi— /i 3 ]. (36) 



n 



TV- 



5. We proceed to apply the results of the preceding section to the calcula- 
tion of the constants of the distribution of a moment coefficient /u u when it is 
taken not about a fixed origin, but about the mean of the sample. 

We have 

fi u = i S {n s (x s ~-xf} = * SKX."), 



<i,a 



ft 
1 



ft 



ft 



or 



1 



6?/^ M = - S{dn s X s u } ~udx{i u -i. 

lb 



(37) 



Ms 



mean value of (dfi u ) 2 



mean of 



ft' 



5 {S(^ 1 3 X 1 2w ) + 2S (rf%rf%Xi"Xa tt )} 



2«/i»-i {s^^Xi")} +wV«-i*?" 



ft 



D 2 
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Substituting the mean values found above, of chii 2 , dnidn 2 , dn^dx and dx 2 we 
get 



M 2 = ^s 



n 



X 



.n 



(l—n 1 /?i)X ] 



2u 



'Z?iin 2 ~v 



ni 



w 



pW-%-1- 1 X^ 1 



n 



+ 2CV 



n 



P u-l[*>2 



= - [M<2u — f* 2 u — 2Uflu- l/Jm + 1 + U 2 fJ> 2 u-l/J>2]> 



n 



(3.8) 



Secondly M 3 = mean value of (df.t u ) 3 , and by (37) 

_ 3^-2 g^^x^ + 2^i&i^Xi w X 2 w ) 

^2 



0% /^/ m— 1 






71 
^ 3w 



S(d%idx 2 Xi u ) — u s fj? u - idx 3 , 



""" " " o"" -*■*■■*- X I 



- 2 Xi^Xa* + 12nil y n * Xi"X 2 u X 8 



w 



w 



w- 



3-- 2 



^ X x ?«- ^?? X^Xa"* -X^ 






j« 



9, 



n x n 2 



n 4 



Xi u X 2 u+1 






W 



% 



?h 



-Xi» +a -/i 2 ^Xi M 






^2L [ 2/X 3 w — 3fJLu/J4u + /A3« — 3 W/Am- i(/A2tt + 1 — 2yU- M + 1^ % ) 



n 

Thus 

Bi-= 



(39) 



M 2 3 

/2 



= Al_ { 2 L6 3 M — Sfl u fM2u + /*3w — 3^/^- i(y^2w+ 1 — ^f^u+lf^u) 

n x 

+ 3lt 2 JJL 2 u ~i(fl u+2 — fJ, 2 fjC u ) — U d /JL 3 u -i^ S } 2 ^ 

(/*2u — fJ?u— 2ufJ>u-lflu+l + U 2 f*> 2 u-lH>2f. 

For M 4 we have again, from (37), 
1 



(40) 



(<W 



7l 4 



[S(dwiXi«)] 4 






4 ^r x S [^(^X^ 4- 3^% 2 ^ 2 X 1 2w X 2 w + 6dihdn s dn B X 1 »X 2 u X z u )\ 
fe ^ M " 1 S[^ 2 (^ 1 2 X 1 2w + 2^ 1 fZ^ 2 X 1 w X 2 M )] 

76 
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and M 4 = mean value of (d^y for all samples. Now the mean value of the 
first term is clearly obtained by dropping the accents in (20), and so equals 

The mean value of the second term is 

+ 3%?^ [ {0(l-^/7i) + ^ , MX 3 -(27ii^/w + ^7%)Xi]Xi a «X 2 ' 



n 
n\n^riz / - v - , v - , v xv MV - mY ^ w 



- 6 % <£ ^2 ( Xl + X 2 + X 3 )X 1 «X 2 "X 8 
The third term has for mean value 

§U 2 fJL 2 



n 2 



L ^ \ w / ^ n n 



UKJ)' v 2w+2 2%% 2 



+ ^lVi_ Xi 2 W+ 2 + 



71 ?l "" ^ 2 



cf>(2X 1 X 2 ^fi 2 )X 1 u X 2 ^ £ (Xi-Xa^Xi-Xa- 



w, 2 






>. 



The fourth term has a mean value 



: £"IL %[3^2/^ W +l+™ (/Att+8+3yt*2ftM+i— ^3A*m)], 



^2 • — - • ■ 7l 



whilst the mean value of the last term is given by (26) and equals 



«V«-i -f ^ M2 2 + ^ (a*. + 3rf)\ . 



C w 2 ' n s 

On adding the various terms together, we find that M4 can be written in 
the form 

M 4 = X ( </, + 2 1) pa + *£ (Q _ 3 F), (41) 



where P = /A 2w -^ 2 -2%-i i w K+1 + %y M _i/i2, (42 

Q = ^ u — 4/x 3w yt% + S^ufiu 2 — 3/x M 4 

— 4^/X w _i(/X3 M+ i — 3/J,2u + l/^u ~ 3/X M + \[Xu) 
+ 6^ 2 /X 2 M -i(yL62w+2 ~ 2fAu+2fJl<u + ^2/^ 2 ) 

— 4:U S fl\- l(/U M+3 — flSfJiu) + ^^u-l^) (43) 
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From equation (38) we see that M 2 = ^ P. (44) 

n 

Hence B 2 = •=-— 

Mo 2 



31+ + &) 



+ &(§. 3). (45) 



Also (40) may be written 

Bi = 2q£ (46) 

where E = fj, Su — 3/^ M /^4-2//, 3 w — 3ufiu-i(fi2u+i — 2fJ>u+iMu) 

+ 3w 2 ^ 2 M -i(A^+a--At K /A2)—w 3 ^ 3 M -i^3. (47) 

6. When % is small compared with N 

so that (46) and (45) become 

Bl = ^' Ba " 3 = KS~ 3 )' 

and Bi, B 2 rapidly approach the normal values zero and three as n increases, 
provided E 2 /P 3 and Q/P 2 — 3 do not become large. Now we cannot calculate 
either of these expressions unless we know the actual distribution, but, for 
large samples, they occur as coefficients of the small quantity ljn and we 
may get a good idea of their order of magnitude by considering their values 
f or the case of a normal distribution. Now for such distributions 

fi2u+i = 0, (48) 

and i±2u = (2to—l)fi 2 fi2u-2 = ^~u — 77? ^^ 

Hence for odd values of u 
E = 0, 

Q = (/*4« — ^A t w-lA t 3w+l)--3^/£r M --i(/i3 tt+ i---^/A M -i/X-2 M + 2) 

For even values of it 

P = M>2u—Pu> (5 2 ) 

E = (jJfa — fJtoUw) — 2fa(/J.2u-~f*u)> (53) 

Q = (M4w — fJ>9uf*u) — Ofl u (^3u — H>2upu) + 3 / U M 2 (/^2w ~ H>u)- (54) 
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To calculate these values in numerical cases, we use, when u is odd, the 
formula : — 



/x 2 ^-2)l f 



(u — l)u...(w — l) 



it(w—it)l 



1 vry nT~il"2"J"-V2 / I 



w—u- 




(55) 



and when 16 is even 



Mw f^uf^w— u 



tA2 w/2 (w — U — l)l f (W — U)(W — U+1)...(W — 1) 



2v>I2-2 ( w ~ u — l\\i 2( 



W — U\(W — 16 1 \ /W -, 



.2 



(^-l)h 

' l)!J( 56 ) 

both being simple consequences of equation (49). 

In practice moments of higher order than the fourth are rarely employed. 
The following Tables give the values of P, Q, E, for the moments up to the 
sixth order and the resulting values of Bi, B2. 

Table I. 



u 


P. 


Q. 


R. 


2 
3 
4 
5 
6 


2ju 2 2 
6/W* 

96 M2 4 
720 fxi 

10,170 jU.2 6 


60 ix 2 4 

3,348 M2 6 
1,907,712 /u 2 s 

545,875,200 ^ 10 

3,141,804,591 ^ 2 12 


8ju 2 3 
zero 

9,504 ji^ 6 
zero 

33,998,400^ 



Table II. — Values B b B 2 for the Frequency Distributions of the First Six 
Moment Coefficients about the mean of samples of n out of a population 
of N, approximated by using normal values in the coefficients of 1/n. 



u. 


Bj. 


B,-« (♦ + **!). 

x \ ^ / 


2 

3 

4 

O 

6 


X » 

zero 

X' 2 102* 
X » 

zero 

x ' 2 549* 
X » 


</>' 12 
X » 

^ 90 

X ^ 

(/>' 204 
X n 

</>' 1050 

X » ! 

4/ 3035* 

X ^ 



* These are approximations to 102-^\, 549 '4448, 3034*646 . 
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It must be remembered that when the ratio of n to 'N, i.e. size of sample to 
total population, is small, the quantities ^, %',.<£/ and 4> + 2fi/n all approach 
unity. The deviation from normality of the fifth and sixth moment 
coefficients is considerable, even for samples of 10,000, and the /3 2 of the 
fourth moment differs very significantly from three for samples of 1000. 

The size of the sample is more influential than the ratio of the sample to 
the population. Consider a sample of 1000, extracted from a population 
of 1,000,000, to six places of decimals: % = 0-999001, #' = 0*998002, 
fi = 0*998002, so that to three places: — 

<f> + 2cj)'/n = 0-999, x ' 2 /x = °* 99 ^ ¥lx = °' 999 > 



and 



3 



~(<f> + 2(j)'/n) = 3-000. 



For such a sample the /3-coefficients of the first four moments are given 
in the following Table : — 



Table III. — Approximate "Values of /3 h /3 2 for Samples of 1000 out of a 

Population of 1,000,000. 



11. 


ft. 


2 
3 

4 


0'0( 


o-u 



ft. 



3-012 
3-090 
3*204 



To three places of decimals, the values in this Table agree with the values 

we should obtain by putting x = X = %" = 1 > or > for a sam P le of 1000 » a 
population of 1,000,000 may be considered infinite as far as its influence on 
the first four moments is concerned. The first four moments are those most 
frequently employed and a sample of 1000 is not unusually small — it is, 
therefore, of great importance to notice that for such a sample the frequency 
distribution of the fourth moment is decidedly skew. 

7. Finally, it is interesting to examine the values we obtained in (19) and 
(22) for the B/ and B 2 ' and /3 ? s of // M , the uth moment coefficient about a 
fixed origin, for the case of an infinite population. 

They may obviously be written : — 

Bi' = „&/**, (57) 

B 2 '-3 = (w32-3)/?i, (58) 

where M /3i, and M /3 3 are the ^-coefficients of a frequency curve obtained by 
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distorting the frequency curve of the original distribution, the distortion 
consisting in raising each ordinate to the power u. 

It seems probable that the values Bi = E 2 /%P 3 and B 2 ~3 = (Q/P 2 — 3)/n 
which are given by (45) and (46) for an infinite population admit of a similar 
interpretation. In fact (52), (53), and (54) show that this is the case for u 
even, when the original distribution is normal, but the general case appears 
at present to be intractable. 



Some Experiments on the Properties of Dielectrics. 

By Spencer W. Kichardson, M.A. (Camb.), D.Sc. (Lond.), formerly Principal 
of, and Professor of Physics at, the University College, Southampton. 

(Communicated by Sir Joseph J. Thomson, O.M., F.E.S. Eeceived July 19, 1915.) 

Introduction. — The phenomenon of the residual discharge has attracted 
the attention of many eminent physicists, and a large number of papers on 
this subject have been published. 

In 1854 Kohlrausch* showed that the instantaneous discharge was 
apparently independent of the residual discharge and that, for a given jar 
left to itself for a given time after charging, the residual discharge was 
proportional to the initial potential ; and in 1873 Maxwellf put forward the 
view that the existence of the residual discharge could be explained by the 
assumption that it was due to the heterogeneity of the medium exhibiting it. 

In 1881 Eowland and Nichols! asserted that Iceland Spar did not exhibit 
the phenomenon of the residual discharge, and claimed that its absence was 
a confirmation of Maxwell's view. I, however, having performed a consider- 
able number of experiments on Iceland Spar, have found evidence of the 
phenomenon in this case. The specimen I have studied * was a carefully 
polished slice — cut parallel to a cleavage plane — in which no flaw of any kind 
could be detected by eye. This specimen was silvered on both surfaces and 
round the edge — a gap being left between the silvered part of each surface 
and the edge for insulation (see fig. 1). 

The charge, as it appeared on one of the surfaces, was conducted away to a 
subsidiary condenser of much greater capacity than the specimen, and the 

* ' Poggendorff's Annalen,' 1854. 
t < Treatise,' vol. 1, 1873. 
X 'Phil. Mag., 5 1881. 



